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Abstract We review the literature on the localization transition for the class of poly- 
mers with random potentials that goes under the name of copolymers near selective 
interfaces. We outline the results, sketch some of the proofs and point out the open 
problems in the field. We also present in detail some alternative proofs that simplify 
what one can find in the literature. 
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1 Copolymers and selective solvents 
1.1 A basic model 

In iTTJl , T. Garel, D. A. Huse, S. Leibler and H. Orland introduced a simple model in 
order to look into how the statistical behavior of macromolecules can be strongly af- 
fected by randomness in the physico-chemical properties of their constituents. They 
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aimed at a special class of macromolecules of linear type, the random hydrophilic- 
hydrophobic copolymers, in a medium of water and oil, separated by an interface. 
Such a polymer chain is just made up of monomers that differ for their affinity for 
water (or oil). The affinity is reduced to a real parameter that we call charge: the 
charge of the j'-th monomer is denoted in [13] by and, in mathematical terms, 
{Cj}j=i,2,... is an IID sequence of Gaussian random variables, with given mean and 
variance. In order both to conform with the mathematical literature and to generalize 
slightly the problem we will write £ ; as C0j + h, where h € K (or h > as we will do 
next) and CO = {(Dj}j=i2,... is an HD sequence of random variables (often referred 
to as the disorder) such that 

M(f) := E[exp(ra>i)] <°°, (1) 

for every fGl and such that Ec»i = 0, Ea)^ = 1 . Apart for the larger class of charges 
that we allow and for notations, the Hamiltonian of the polymer model set forth in 
03 is 

N 

H N , m ,h(S) ■= E {(On+h)sign(S n ) , (2) 

n=l 

where N is the length of the polymer and S = {Sq, Si ,. ..} is a simple symmetric 
random walk trajectory (So = 0, {S n+ \ — S' n } n= o,i ... is a sequence of independent 
identically distributed symmetric random variables that take only values ±1: the 
law of S is denoted by P and we stress that CO and S are independent). 

We invite the reader to have a look from now at Figure [T] for the directed poly- 
mer interpretation of the trajectories of the model. A small detail to deal with is 
sign(0): sign(5„) should be read as sign(5„_i) when S n = and this convention is 
particularly natural in directed polymer terms, because sign(S„) is +1 (—1) if the 
71 th monomer is in the upper (lower) half plane, that is in oil (water), see Figure [I] 
Still to conform with most of the mathematical literature on copolymers, the inverse 
temperature is denoted by A(> 0) instead of the more customary j3, so that the 
Boltzmann factor that defines the polymer model of length N is exp(A/f/y m h(S))- 
We are interested in the quenched system so we underline the very different nature 
of the two sources of randomness: CO is chosen once for all at the beginning of the 
experiment (the hydrophilic or hydrophobic character of the monomers does not 
change, while the chain fluctuates). 

At a superficial level the effect of the charges on the polymer is quite intu- 
itive: for X > positively charged monomers (co„ + h > 0, that is hydrophobic 
monomers) prefer lying in the upper half-plane (oil) and the opposite is true for 
the negatively charged ones. But for large these energetically favorable trajec- 
tories become more and more atypical for P since placing the monomers in their 
preferred solvent strongly reduces the fluctuation freedom of the chain. We are 
therefore dealing with an energy-entropy competition that in the limit — > °° leads 
to a localization-delocation transition: localization arises when energy prevails and 
the polymer sticks to the oil-water interface, visiting thus both oil and water, while 
derealization corresponds to the case in which the polymer prefers to stay away 
from the interface. We will come back to this with much more details, but we an- 
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Fig. 1 The upper part of the figure shows a copolymer configuration: each bond, or segment, 
[(n — l,S n _i), («,£„)] of the random walk trajectory represents a monomer, so that sign(S„) should 
be read as +1 (resp. —1) if this monomer is in the upper (resp. lower) half-plane. Note that the 
Hamiltonian of the copolymer, cf. {2|, does not depend on the details of S within an excursion, 
but only on the length and sign of the excursion. This naturally leads to the generalized model 
introduced in section |1.2| (lower part of the figure) in terms of a general discrete renewal process 
T and a sequence of independent signs t, . It is mathematically useful to use the variable A„ , that 
in the random walk case is just (1 — sign(S„))/2, i.e. the indicator function that the copolymer is 
below the axis. Note in fact that, by (2), H Nm j,(S) = — 2£jLj (coy, + h)A„ + c N (a>), with c N (co) = 
Y.n=i( m n + h) which does not depend on S (or A), therefore we can drop it without changing the 
polymer measure (that is what we do in |4)). 



ticipate that this entropy-energy competition turns out to be rather challenging: the 
phase diagram of the model is for the moment only partially understood and sound 
conjectures (or even only convincing physical heuristics) are lacking on several fun- 
damental issues. This may appear rather surprising in view of the very simple nature 
of the model and of the fact, at the heart of the motivation of lfl3l . that it represents 
one of the simplest instances of a general mechanism that plays a crucial role in a 
variety of extremely important phenomena (protein folding, to name one). 



1.2 The (general) copolymer model 

As argued in Figure [T] and its caption, the basic copolymer model does not depend 
on all the details of S, but just on its zero level set, which is a renewal set, and on the 
sign of the excursions, that is simply an independent fair coin tossing sequence. It 
is therefore natural, and at times really helpful, to look at the following generalized 
framework. Let us consider a discrete renewal process T = {t„}„>o on the non- 
negative integers N U {0}, i.e., a sequence of random variables such that To = 
and {tj+i — T,-} , = o i ... is an IID sequence of positive integer-valued variables, with 
marginal law satisfying 
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K{n):=n^=n) n ^^, (3) 

where a > and Ck > (we write /„ ~ g„ for f„/g„ — >• 1). Since we are dealing with 
renewal processes it is important to stress that P(ti < °°) = 1, that is Y^=\ K(n) = 1, 
so that T is a persistent renewal. More generally, one could replace the constant 
in |3]l by a slowly varying function L(n), but we stick for simplicity to the purely 
polynomial asymptotic behavior (|3). It is well-known that the first return time to zero 
of the simple symmetric random walk inf{n > : S n — 0} satisfies ([3]l (restricted to 
the even integers, due to the usual periodicity issue) with a = j. In particular, the 
basic model presented in the previous subsection is a special case of the generalized 
copolymer model we are defining, as it will be clear in a moment. 

Remark 1. It is practical to switch freely from looking at T as a sequence to con- 
sidering it a random set, so for example |tH [0,N]\ is the number of renewals up to 
TV, or n E T is the event that there exists j such that Xj = n. For a comprehensive 
references on renewal processes see for example 12. 

The renewal T identifies the polymer-interface contacts: we still need to know 
whether the excursion is above or below the axis. For this let ^ = {^„}„ g n denote 
an IID sequence of B{\ /2) variables (that is P(^„ = 0) = P(^„ = 1) = j) indepen- 
dent of T, that we still call signs. Starting from the couple (t,^) we build a new 
sequence A = {A n } neN by setting A n = E7=i^l(T f _i,T J -]( w ). in analogy with the 
simple random walk case: the signs A„ are constant between the points in T and 
they are determined by £, . 

By copolymer model we mean the probability law Pn.w = Pjv.o.A./i f° r the se- 
quence A defined by 

dI V ( 4 ) := _L exp ( -2A £ A n (co n + h) ) , (4) 



dP v ' Z, 



where JVeN, A>0, /igl (but we can and will assume h > without loss of 
generality) and (0 = {o„}„gn has been introduced in § [T] The partition function 

ZN,m=Z N>w i h is given by 



yv.fi) 



E 



exp — 2A AnlcOn + h) 



n=l 



(5) 



In order to emphasize the value of a in ((3), we will sometimes speak of a a- 
copolymer model, but Pn.w depends on the full distribution K(-), not only on a. 



1.3 The free energy: localization and derealization 



We introduce the free energy of the copolymer by 
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f(A,/z) := lim F N (X,h), where F N (X,h) := -J- E \\ogZ N m x J . (6) 

The existence of such a limit follows by a standard argument based on super- 
additivity, see for example [ 14] or [ 15 J Ch. 4], where it is also proven that 

F(X,h) = lim ^\ogZ NmXMl P(dffl)-a.s.andinZ 1 (P). (7) 

Equations |6])-((7]i are telling us that the limit in (|7]i does not depend on the (typical) 
realization of CO, however it does depend on P, that is on the law of (0\, as well as 
on the inter-arrival law K(-). This should be kept in mind, even if we omit P and 
K(-) from the notation f(A,/z). Let us point out from now that f(A,-) and F(-,h) 
are convex functions, since they are limits of convex functions. As a matter of fact 
f(-, •) is only separately convex because of the choice of the parametrization, but it 
is straightforward to see that (X,h) H> F(X,h/X) is convex. 

Remark 2. It is sometimes useful to consider the constrained partition function 
Z N,m = Z NwXh of the mode1 ' defined by 



Z N,a,X,h '■— E 



exp ( -2A Y,(a>n+h)A„j 1{wgt} 



(8) 



which differs from |5]) only by the boundary condition factor l^ez}- It i s a standard 
fact JT3J Remark 1 .2] that for all N, X, h we have 

Z N : (o,X,h ^ Z N,m,X,h < CNZ N,co,X,h > ( 9 ) 

where C is a positive constant. In particular, the free energy F(X,h) does not 
change if Z N m X h is replaced by Z^ . , in |6]) and |7]). Furthermore, since i-> 
E(logZ^, m x h ) is a real super-additive sequence, we can write 

F(A '^ = ^ ^ E ^ Z N,o>,l,k) = supiE(logZ= Xh ). (10) 



A crucial observation is: 



F(X,h) > for every X,h > 0. (11) 

This follows by restricting the expectation in ([5]) to the event {fi > N,%\ = 0}, on 
which we have A\ =0, . . . , An = 0, hence we obtain Z^.m > ^{^i > N) and it 
suffices to observe that A^ 1 logP(Ti > N) vanishes as — >• °°, thanks to Q. Notice 
that the event {ti > N, %\ = 0} corresponds to the set of trajectories that never visit 
the lower half plane, therefore the right hand side of ( fTT| may be viewed as the 
contribution to the free energy given by these trajectories. 

Based on this, we say that (X,h) £ @ (delocalized regime) if F(X,h) = 0, while 
(A, h) <E J? (localized regime) if F(X,h) > 0. This may look at first as a cheap way 
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to escape from the real localization/delocalization issue, that is inherently linked to 
the path properties of the measure P# eo, but it is not the case. Notice in fact that, if 
h i — ^ v(k,h) is differentiable (which fails at most for a countable number of values 
of h, by convexity), by differentiating (|7]i and by convexity arguments we have 



=l-37F(A,/z) = lim Eat.g) 
Lk ah 



N 



-a.s., 



(12) 



where 



N 
n=\ 



(13) 



is just the total number of monomers in the lower half-plane, that is in water (cf. 
Figure[T]i. Therefore if (A , h) is chosen in the interior of $>, where F = 0, the polymer 
visits water with null density (Jfa/N —> 0). On the other hand, if (X,h) is in ££ , 
the polymer puts a positive density, precisely —(l/(2X))dF(X,h)/dh G (0,1), of 
monomers in water and the rest, still a positive density, in oil. 

We will deal below, cf. section [5] with sharper results on path behavior, but the 
elementary observation we have just made shows that the definition we have set 
forth of localization and derealization is far from being artificial. As a matter of 
fact, it is the natural physical definition, and in fact it has been used already in |[T3l . 
while in the mathematical literature was first introduced by Q. 



1.4 The phase diagram 



Convexity and the evident mono tonicity of f(A, •) put strong a priori constraints on 
the phase diagram: let us go through this before going toward sharper questions. We 
can set 

h c {X) := sup{/i: f(A,/i) >0}, (14) 

and the monotonicity of f(A, •) guarantees that (A, h) £ Jz? if and only if h < h c {X), 
namely that h c {-) is the critical curve. Let us derive a number of elementary proper- 
ties of h c (-). 

The fact that h c (X) < °° for every A follows by the standard annealed bound: 



ElogZw.,, < logEZ;v. m = log EE 



exp — 2A E An{(O n +h) 
. \ »=i 

) g EexpKlogM(-2A)-2A/0£z\„J , (15) 



so that F(X,h) < (hence, recall ((TT), F{X,h) = 0) if h > logM(-2A)/(2A), and 
log M(-2A)/(2A) < °o for every A by (l). 
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Remark 3. The exponential of the rightmost term in (JT3J is the partition function 
of the annealed model associated to our quenched model. The free energy of the 
annealed model is rather trivial: it is in fact an elementary exercise to see that 

Urn ilogEexp^(logM(-2A)-2A/0^A^ = (logM(-2A) -2Xh)+, (16) 

where a + := al a >o- The annealed model has therefore a (de)localization transition 
too: its critical curve is h" nn (X) := logM(— 2A)/(2A) and we have just remarked 
that h c (X) < h" n "(X) (this inequality on the critical curves is also referred to as 
annealed bound). It can be noticed also that the annealed free energy is not C at 
criticality, that is the transition is of first order. Let us stress that the annealed free 
energy looses a lot of details of the original model (in particular: no trace of K (■)]). 

We have also h c (X) > as soon as A > 0, but this is not a trivial statement. In fact 
this means showing that f(A,0) > for every A > 0: in more dramatic terms, if the 
interaction does not select, on the average, a preferred solvent (h = 0), the polymer 
is localized even at arbitrarily weak coupling, a result established first in |26|. We 
skip the proof of this fact (in section [2] the proof of stronger results is sketched) 
and we simply observe that, together with the annealed upper bound, it implies that 
h c (X) — >0asA\0. 

At this point convexity can be used in a very profitable way: since {(A,y) : 
f(A, y/X) < 0)} is a convex set, its lower boundary A M> Xh c (X) is a convex func- 
tion. So we can write h c {X) = g(X)/X, with g(-) convex such that g(X) = o(X) as 
A \ 0. This directly implies in particular continuity of h c (-) and, with a little bit 
more of work, also the fact that h c (-) is strictly increasing J4). 

Quite a bit of effort has been put into pinning down the value of h c (-). Figure [2] 
sums up the results that are known on h c (-) and, in particular, the content of 

Theorem 1. For every A > the following explicit bounds hold: 

' logM(-2A/(l+a)) < h c (X) < -LlogM(-2A) = h« m {X), (17) 



2A/(l + a) v 2A 

where the left inequality is strict when a > 0.801 (at least for X small). 

The lower bound in (F7\ is proved in |4|. The strict inequality in the upper bound in 
(17) was first proved in 1 30 1 to hold for large A and then extended to every A > in 
1 3. In section|3]we give an alternative, more direct proof. 

Highlighted in Figure [2 is the small A behavior of h c (X). In fact, as more exten- 

sivily explained in section 4 for a € (0, 1) we have h c (X) ~ m a X, with m a > 
depending only on a. The slope m a is therefore a universal feature of the model: 
it does not depend on the details of the disorder sequence co and of the underlying 
renewal T. The proof of such a result goes through showing that for A and h small 
the free energy of the copolymer is close to the free energy of a suitable continuum 
polymer model. 
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Fig. 2 On the left there is a sketch of the phase diagram and the critical curve X i-> h c {X). The 
localized (resp. delocalized) regime corresponds to {X,h) lying strictly below (resp. above) the 
critical curve. Explicit upper and lower bound on h c (-) are known, cf. Theorem[TJ and they are 
schematically drawn as dashed lines. On the right one finds a zoom of the region near the origin, 
where the critical curve is close to a straight line and essentially all the relevant information close 
to the origin (weak coupling regime) is encoded in the slope of this line: while the details of the 
critical curve do depend on the law of the disorder ft) and on the details of K(-), the slope depends 
only on a. This is an important universality feature to which section[4]is devoted. 



It becomes therefore quite relevant to get a hold of the value of m a (at least for 
a € (0, 1)). As a matter of fact, from ( fT7| i one directly extracts 



1 + OS 



< m a < 1 , for every a > . 



(18) 



but this result can be sharpened to 



,1 g(«) 1 

max - , , , 

.2 y/T+a l + a 



< m a < 1 , 



(19) 



where g(-) is a continuous function (of which we have an expression in terms of 
the primitive of an explicit function) such that g(oc) = 1 for a > 1 and for which 
one can show that g(a) / y/T+a > 1/(1 + Of) for a > 0.801 (by evaluating g(-) 
numerically one can extend this result to a > 0.65). Since the existence of m a is not 
guaranteed for a > 1, to be precise both in ([18) and in ( fT9] ) m a should be replaced 
by the inferior and superior limits of h c (A ) /A . 

Remark 4. For sake of conciseness we have left aside the a = 0, that would require 
replacing the power law behavior Q with a regularly varying behavior (allowing, in 
particular, the presence of logarithmic multiplicative corrections). The bounds that 
we have just presented directly generalize lTT5l Ch. 6] and for a = the three terms 
in ( [T7] i coincide. 



We would like to stress that Theorem [T] and the bounds ( fT9| ) show that some 
claims in the physical literature are wrong. Notably in lfl3l l32l it is claimed, for 
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a = 1/2, that h c (-) coincides with h " {■) (and ( fT9] l shows that also the weaker 

claim that h c {X) lif^ik) is not correct. In ||28ll23l it is claimed that, still for 
a = 1/2, the inequality in the left-hand side of ( fT7| ) is an equality. Theorem [T] falls 
short of proving that also this claim is false (even if it suggests it). A numerical 
study [9 1 lead for a = 1/2, complemented by a careful statistical analysis using 
concentration inequalities, strongly suggests that the lower bound in (TJ\ is strict 
and that the critical curve is somewhat halfway between the lower and the upper 
bound. 

What the previous results and discussion expose is that a convincing heuristic 
theory predicting the location of the critical curve, or just its slope at the origin, is 
lacking. In this sense we consider that capturing the value of m a , for a £ (0, 1), is 
an important open problem. Since computing explicitly quenched quantities may be 
really out of reach, here are two sub-problems that are open: 

★ show that Wj/2 > 2/3; 

★ is m a — 1/(1 + a) for some a > 0? 

Finally, we point out that a reduced (simplified) copolymer model was introduced 
in 1 4 1, taking inspiration from the approach in (23]. The original hope was that this 
simplified model could catch the main features of the original copolymer model. 
However, this does not seem to be the case, since it has been shown l30l l6ll that for 
the reduced model one has m a = 1/(1 + a) for all a € (0, 1). 



1.5 The critical behavior and a word about pinning models 

Claims can be found in the physical literature about the critical behavior of this 
model (at least in the original set-up, a = 1/2, e.g. ifTUl I2T1 l23l l32l ). but these 
claims do not always agree with each other, apart for the fact that they all claim, 
not surprisingly, a smoothing effect of disorder. A rigorous result available on this 
issue has been proven in |[T9l : the transition of the general copolymer model is 
smooth (the derivative of the free energy vanishes at least linearly when the critical 
point is approached, hence it is at least Lipschitz continuous at the critical point), 
in contrast with the annealed case (where the the derivative of the free energy has 
a discontinuity at criticality, cf. Remark [3JI: for every X > there exists c(A) < °° 
such that 

f(AA(A) + <5) < (l + a)c(A)5 2 (20) 

for every 8 > 0. The result was obtained under some technical conditions on the 
disorder law, which are satisfied for instance in the case of Gaussian or bounded 
charges. One can of course wonder whether the critical behavior of the copolymer 
model depends or not on a (and on A?) and how, but once again, the substantial 
lack of sound physical predictions is quite disappointing. A natural open question 
however is: 
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* Can one improve §20\ , in the sense of replacing the exponent 2 with a larger 
value? 



A somewhat deeper insight into this issue can be achieved by considering also an- 
other class of models, as we explain next. 

The bound ( |2"0] > in fact coincides with the one available for disordered pinning 
models. Pinning models are a close companion to the copolymer, since the Boltz- 
mann factor of a pinning model is 



where 8 n = Inez, that is ls„=o m tne random walk set-up (co is chosen as before, 
so that fi(D„ +h is a random variable of mean Agl and variance /3 2 ). Therefore in 
this case the polymer has an interaction with the environment only when it touches 
the oil-water interface (or simply when it touches the x axis, usually called defect 
line, since the model does not depend on the sign of the excursions). It is well 
known that pining models exhibit a localization transition too and they can be dealt, 
to a certain extent, with similar techniques |[T5l . However, in the end, there are 
considerable differences, but let us try to single them out and see what they suggest 
for copolymers: 

1. The annealed pinning model is much richer than the annealed copolymer (cf. Re- 
mark]^. In particular, the annealed free energy does depend on K(-) and the crit- 
ical behavior depends on a: the transition is continuous (that is, the free energy 
is C 1 ) as soon as a < 1 and it becomes smoother and smoother as a approaches 
lfl5ll . Harris criterion (see references in [15] [16)) gives a precise prediction on 
what to expect for systems for which the annealed system has a transition that is 
sufficiently smooth (for the pinning case the criterion boils down to a < 1 /2): 
essentially it says that quenched and annealed systems have the same critical be- 
havior and it gives a precise prediction of the shift in the critical point due to 
the disorder (irrelevant disorder regime). At the same time it suggests/predicts 
that for a > 1/2 disorder is relevant, even if arbitrarily weak. This scenario has 
now been made rigorous, see ifTBI and references therein, for pinning models. 
The crucial point for us is however the fact that the free energy of the annealed 
copolymer model is not differentiable at the critical point and therefore, in the 
Harris sense, disorder is always relevant. 

2. Understanding critical phenomena when disorder is relevant is a major challenge 
and the possible scenarios set forth in the physical literature are quite intriguing, 
but very challenging and, at times, controversial (see e.g. [24 33 1 and references 
therein). In this sense also the question that we have raised about improving |20} 
acquires particular importance. 

3. When a > 1, also the annealed pinning model free energy is not C , and the 
critical curve has been identified with no more precision than for the copolymer 
model. In fact the annealed pinning critical curve (again, a curve separating lo- 
calized and delocalized regimes, in the (j3 , h) plane) behaves like — j3 2 /2 when j3 




(21) 
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is small and the quenched critical curve is in [— c+/3 2 , — c_j3 2 ] for j3 small (with 
explicit values of the constants < c_ < c + < 1/2, cf. ifTTlD . This is absolutely 
parallel to the fact that for the copolymer model h c {X) G [c_A, c+A], as one reads 
for example out of < fT~8] >, ( fT9] ). 

Finally, it is natural to wonder what happens when a pinning interaction is added 
to the copolymer model, that is when not only the solvents are selective, but some- 
thing special goes on at the interface (thus taking into account for example the lack 
of sharpness of the interface or the fact that impurities could be trapped at the inter- 
face). There are works on this model, often called copolymer with adsorption (see 
for example ||251 l27l l34l ). but the understanding is very limited: we refer to lfT31 
§ 6.3.2] for a detailed overview on this issue. 



1.6 Organisation of the paper 

The rest of the paper is devoted to going deeper into various results that we have 
stated, by giving either a sketch of arguments of proof, or alternative proofs and 
results that complement what can be found in the literature. More precisely: 

• In section 2 we give a sketch of the proof on the lower bound in ( |17) . 

• In section 3 we give an alternative proof of the upper bound in ( fTTj i. 
In section]4|we discuss the universality features of the copolymer model in the 
weak coupling regime, i.e., for small values of X, h. 

Finally, section[5]is devoted to the description of the available results on the path 
properties of the copolymer model. 



2 Localization estimates 

The aim of this section is to give a sketch of the proof of the lower bound in Theo- 
rem[T] that is of the left inequality in ( |17) , as well as of the left inequality in (19) . 

The key-phrase for the approaches in this section is: rare stretch strategies. The 
idea, inspired by the renormalization group approach in J23), is to restrict the par- 
tition function to polymer trajectories that can visit the lower half -plane only when 
there is a stretch of monomers that are particularly, and anomalously, hydrophilic. 
To do this we introduce an intermediate scale I (large, but fixed) and look at the 
sequence of charges in blocks of I charges at a time. We will consider two strategies 
(A and B) and, for simplicity, we will assume d>\ ~ ^(0, 1): 

A: the / h block is good if Y/ n= i --im+i + ^) — — m ^> v/ith m a positive value to 
be chosen below. For I large, the probability that a given block is good is very 
small, about exp(— l(h + m) 2 /2), so that good blocks are typically separated by 
a distance of about exp(£(h + m) 2 /2). 
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B: the / h block is good if Y^n-iv+i C ^ = o{i). For t large, the probability 
that a given block is good is again very small for h > 0, about exp(— £h 2 /2), 
so that good blocks in this case are typically separated by a distance of about 

exp{£h 2 /2). 
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Fig. 3 The lower bound strategies presented in this section are two and they are both based on 
selecting some good blocks, just by looking at the environment. They are actually blocks in which 
the environment is atypically negative: in strategy A we really select blocks in which the empirical 
average of the charges (O n + h is smaller that a value —m < (selected in the end, in order to 
maximize the gain) and in strategy B we just aim at an empirical average close to zero (a rare 
event anyway, for h > 0). Then for strategy A we make a lower bound on the partition function by 
visiting the lower half plane if and only if a block is good and by insisting that in a good block the 
walk stays in the lower half plane. In strategy B we target again the good blocks, but we put no 
constraint on the walk in the good blocks. 



Given a sequence CO of charges, the good blocks are identified by the rules just 
given and we introduce a set of polymer trajectories £2^ m , resp. £2^ m , for the strat- 
egy A, resp. B, defined as follows. Q^j a, lS the set of trajectories that stay in the 
upper half plane except in presence ofgood blocks, when they stay in the lower 
half plane, see the upper part of Figurepj The set of polymer trajectories £2^ m st iU 
includes all trajectories that stay in the upper half plane except in presence of good 
blocks, but when there is a good block the only restriction is that the polymer has 
to touch the oil-water interface just before every good block and it has to touch it 
again at the end of the block, see the lower part of Figure [3] 

The estimates are now just based on observing that 



Zn.g> > E 



exp I -2k £ A n (co„ + h) \ ; £2$ >0 



K{hl) 



(>'l+l) 



exp —2k (°>n + h) 



n=i,(+l 



K(£) 



K((i 2 -h)£) ( , {h ^ )l , \ Kit) 

U2 U U xp [-2k £ (CDn + h) 



(22) 



n=i 2 (+l 
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where the first good block is the /* and so on. By using the definition of £2^ m 

we see that each term — 2A L„£|~.2h fan + h) lS bounded below by 2Xml and we 
notice that the right-hand side of ( |22| > is a product of terms that, typically and to 
leading order, are just the same term, because the distance of good block is about 
e\p(l(h + m) 2 /2) . Therefore this product of terms will give origin to an exponential 
growth in N (localization!) if 

- -Y^ £ ( h + m ) 2 + 2Um + ° W > ( 23 ) 

(we have of course used (|3]l). If we now optimize the choice of m we readily see that 
this condition is met if h < A/(l + a), for £ sufficiently large. Therefore h c (X) > 
A/(l + a), which is the lower bound in ( fT7] > for the special case of Gaussian charges 
(the extension to general disorder is straightforward). 

Strategy B exploits a factorization similar to p2| but this time the contribution 
in the good blocks is of about exp(F(A,0)^), since in the good blocks the empirical 
average of the charges is zero, so the charges are essentially centered and for large 
£ the random variables in a good block have a distribution that is close to centered 
IID standard Gaussian variables (in the complete proof, cf. J3] Sec. 2], this step 
is performed via a change of measure argument that makes rigorous the heuristics 
presented here: the argument is slightly more involved for non Gaussian charges). 
The analog of ( |23] l in this case becomes 

-^^£h 2 + v(X,0)£ + o(£). (24) 

At this point we need to estimate f(A , 0). For example if one can show that f(A, 0) > 
cA 2 for come c > (say, for X < Ao) then for the same values of A one would have 
that h c (X) > X\j2cj{\ + a). This is the basic idea leading to the (middle) lower 
bound in $19\ . Of course the work is now on estimating c. We will not go into this 
issue which, ultimately, is a refinement of the result in ll26l and we refer to UJ Sec. 
2] for details. But we point out that 

• as explained in section|4] one can show that f(A , 0)/A 2 has a positive limit that 
can be expressed in term of the free energy of a continuum polymer; 

• in order to improve on the lower bound in ( fT8| one needs \Jlcj(\ + a) > 1/(1 + 
a), that is c > 1/(2 + 2a). This can be established, as recalled just below ( [19) , for 
a > 0.65. If we were to improve on the lower bound in ( fT8| l with this strategy for 
a = 1/2 we would need to show c > 1/3, but numerical estimations suggest that 
lim^^ F(A,0) /A 2 is smaller (probably by little) than 1/3, so it is very likely that 
this strategy (barely) fails to establish that the lower bound in ( fT8] l can be made 
strict for a = 1 /2. The interest on this issue is because it would prove that the 
claims in 1 23 , 28 1 are not correct. 
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3 Derealization estimates 

In this section we address the upper bound in Theorem[T[ that is the right inequality 
in (17) . We will actually present (in full) an argument that is substantially easier 
than the one originally used (5), even if it has the drawback to work only for a £ 
(0, 1). This argument is still based on the fractional moment method (first used in 
the copolymer context in [ 30] to show that the upper bound in ( flT) holds for X large 
and unbounded disorder), but it avoids the change of measure argument used in [5]. 
The change of measure argument in J5J fTTTl is an important technique, as well as 
its refinement in OTl that leads to the upper bound in ( fT9) , but we will not discuss 
these techniques here. 



3.1 Fractional moment method: the general principle 

We consider the fractional moment method in its most elementary application. For 
the constrained partition function |8]l we can write 

N k i , -2A(E,._ 1< ^< (j £B/+ftft— 

W=E I Y[~ 2 (25) 

k=10=t <t l <...<t k =Ni=l 

For 7 € [0,1], from the inequality (a + b) r < a r + valid for all a, b > 0, we obtain 
the upper bound 

H&Nm) 7 ) < I E t[Ky,X,h{n-ti-l), (26) 

k=l 0=t <t l <...<t k =N i=l 

where we define 

~ i i „— (2Ayft— logM(— 2Ay))n 

Ky.x.h{n)-= ^ ^ ( " )r ' (2?) 

We also set 

neN 

Assume that E(y,k,h) < 1. By classical renewal theory [2|, the right hand side 
of ( f26) equals the probability that a renewal process with step probability (or sub- 
probability, if Z(y,A,/z) < 1) Kyx.h{') passes through A^; in particular, it is bounded 
by 1 . Then 
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whence f(A , h) = by ( fTTj ). 

This means that, when there exists y £ [0, 1] such that E (7, A, h) < 1, it follows 
that (A,/?) g £?. This allows to give explicit estimates on the delicalized region. 
Note that for 7 = 1 we find the annealed delocalized regime, that we have already 
introduced: in fact E(l,X,h) < 1 when h > hf n (X) := logM(-2A)/(2A). Since 
for 7 € [0,1/(1 + Of)] one sees immediately that .£(7, A,/?) = +°°, the interesting 
range is y£ (l/(l + a),l). 



3.2 Fractional moment method: application 

Let us define for A > 

h(X):=mf{h>0: 3y£ [0, 1] such that E(y,X,h) < 1} . (29) 

Proposition 1. For a £ (0,1) we have h c (X) < h(X) < h c " m (X)for every A > 0. 

Proof. We have just remarked that E(y,X,h) < 1 implies (A, h) £ '3, therefore 
h c (X) < h(X). It remains to show that h(X) < h" nn (X) for every A > 0, that is, for 
e > sufficiently small we can choose y£ [0, 1] such that E(y, A, hf n (A) — e) < 1. 
Note that for y£ (0, 1) 

d y 1 , -(ylogM(-2A)-logM(-:Uy))n 
^(7,ACW)=I — » (logK(n))K(ny 



-2A 



(lo g My<-2A) + l ^tMl 



-(ylogM(-2A)-logM(-2Ay))n 

I , "W- 

nGN Z 



By the strict convexity of logM(-) and the fact that logM(0) = 0, 

(■0gM)-(-2X) < '^ M "" -^ M '-"» - . (30, 

Recalig our assumption (|3), for a £ (0, 1) we have LneN n ^( n ) — °°> therefore by 
Fatou's lemma 

^(r,A,Ar(A)):=Um^(7,A,Ar(A)) = +oo. (31) 

SinceX(l,A,/zf"(A)) = 1, it follows that Z(l - 77, A, /if "(A)) < 1, for 77 > small 
enough. By continuity, E(l — r\,X,h" nn (X) — e) < 1 for £ small enough, and the 
proof is completed. □ 



16 



Francesco Caravenna, Giambattista Giacomin and Fabio Lucio Toninelli 



4 Continuum model and weak coupling limit 



In this section we explain in some detail the universality feature sketched in Figure|2] 
and its caption. The idea is that at weak coupling the details of the model, that is the 
law of the renewal beyond the exponent a and the law of the disorder, are inessential 
and a suitable continuum model captures the leading behavior of the (large class of) 
discrete models we consider. 

As we already remarked, it is convenient to look at the renewal process T = 
{l*}jfc>0 as a random subset of [0,°°). It follows from our assumption ([3]) that the 
rescaled random set er = {£Tk}k>o converges in distribution as e \ toward a limit 
random set T a , the so-called a-stable regenerative set (we refer to lfl2l for more 
details; cf. also 1 8 ]). This is a random closed subset of [0, °°) which is scale-invariant 
(cT a has the same law as T a , for every c > 0), has zero Lebesgue measure and no 
isolated points. For a = \ we have the representation T a = {t € [0,°°) : B t = 0}, 
where B is Browian motion. 

The complementary set (?")' , being open, is the countable union of disjoint open 
intervals {I n } ne n- We can then define a continuous -time process A a = {A"} te p^, 
which is constant on each /„ and takes the value or 1, decided by fair coin tossing: 
more precisely, in analogy with the discrete case, we set A, := ^neH (0 where 
{^n}neN are i-i-d. B(j) random variables. For a = \ we have the representation 
A" = 1{£,<o} with B a Brownian motion. In general, A a may be viewed as the limit 
in distribution of the rescaled discrete process {4^/ £ j } fG [o,oo) as e \ 0. 

Let now (j3 = {fit}te[Q,°°)^) be a Brownian motion, independent of (A a ,P). We 
proceed for a moment in a somewhat informal way: as a \ 0, a~ l (Oy t / a i^ converges 
toward the white noise d/3 r /df and ^[ t / a 2 \ converges toward A,, therefore 

N/a 1 N N 

ak (ton +ah)A n = A / (a _1 0)^/ a 2j +h)A[ t / a 2j 6t w A / (dj3, +hdt)A" . 
This hints at introducing a continuum partition function 



Z t,j3 = Z t,P,X,h '■— E 



exp -2A / A?(dp s +hds) 







(32) 



so that, recalling |5]), one should have Z N / a 2 m a ^ ah ~ Z N ^ x h for a small. 

We now turn to precise statements. One can show that the definition ([32) of the 
continuum partition function is well-posed, for P-a.e. j3, and one introduces the 
corresponding continuum free energy F a (A,/i) in the usual way: 

v a (X,h) = Um^Elog^^. (33) 
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The existence of this limit and the fact that it is self-averaging (i.e., the expecta- 
tion E can be dropped) require a much longer and technical proof than the discrete 
counterpart, cf. (8). 

Also the continuum free energy is non-negative: F a (X,h) > for all X,h > 0, as 
one can easily check. The localized and delocalized regimes can therefore be defined 
in analogy with the discrete case, namely (A , h) £ Jz? (resp. (A , h) 6 if F a (A , h) > 
(risp. F a (X,h) = 0), and it is easily shown that they are separated by a critical 
curve: <£= {(X,h) : h < h%{X)} and 9 = {(X,h) : h > h?(X)}. There is however 
a major simplification with respect to the discrete case: the scaling properties of 
the processes A a and yield easily F a (aX,ah) = a 2 F a (X,h) for all X 7 h,a > 0, 
therefore the critical curve is a straight line: h®(X) = m a X for some m a . 

We can finally come back to the rough consideration Z N j a 2 ma x „/, ~ ^N,fi,X,h^ 
that was discussed above. This can be made precise in the form of the following 
theorem. 

Theorem 2. For an arbitrary discrete a-copolymer model we have 

lim \i>(aX,ah) = F a (X,h), VA,/!>0. (34) 



a\0 a A 



Moreover 



lira — — = m a . (35) 

A\0 X 

This result was first proved in [7| in the special case of the basic model of sec- 
tion[T] i.e., for the discrete copolymer model based on the simple random walk on 
Z, corresponding to a = \ (in fT71 one can find an argument to relax the assumption 
in |7| of binary charges and in [25] the case with adsorption is treated, cf. the end 



of section 1.5 1. The generalization to arbitrary a-copolymer models, with general 
disorder distribution, is in JH). 

Note that p4| yields directly the existence of the limit as A \ of f(A,0)/A 2 , 
that was anticipated in section [2j as well as the fact that this limit coincides with 
F a (l,0) > 0. We also point out that ( |35| ) is not a direct consequence of (|34j. 

The importance of Theorem [2] relies in its universality content: for any fixed 
a € (0, 1) there is a single continuum model that captures the behavior of all dis- 
crete a-copolymer models for small values of A and h. In other words, the differ- 
ences among these models become irrelevant in the weak coupling limit. From this 
viewpoint, the slope m a of the continuum critical curve is an extremely interest- 
ing object: improving the known bounds < m a < 1 would yield a substantial 
improvement in the understanding of the phase transition in this class of models. 



5 Path properties 



Up to now we have discussed the localization-delocalization transition only in terms 
of free energy. A complementary, and equally interesting, point of view is that of 
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looking at path properties. In other words, how does the typical (under Pjv,ro, for 
typical co) polymer trajectory look like? The bottom-line of the picture which has 
emerged up to this day is the following. In the localized region the polymer makes 
order of N excursions between the two half-planes; the lengths of such excursions 
are 0(1) and their distribution has an exponential tail. In the delocalized region, on 
the other hand, the number of monomers in the defavorable solvent (i.e. in the lower 
half plane) is not only sub-linear in N (this information can be obtained immediately 
from the fact that the free energy is zero there, cf. ( fT2| >) but it actually does not 
exceed (log AT), with high probability. In the following, we discuss this picture in 
a bit more detail. 



5.1 The localized phase 

This subsection is is extracted from ITSl . to which we refer for additional results, 
concerning for instance the exponential tail of the length of the polymer excursions 
between the two solvents. Path properties in the localized phase have been studied 
also in JIIO. 

Let Mat := max, :Ti <Ar(T/ — T,-_i) be the length of the longest polymer excursion 
between the two solvents. The following result says that in the localized region 
correlations decay exponentially fast, and the longest excursion is of order logA^: 

Theorem 3. Let (A,/z) € _Sf . There exist constants c\,C2 such that, for every pair of 
bounded local functions A, B of X we have 

supE[|E Wifi) (AB)-E J v, £B (A)E JV!£0 (B)|]<c 1 ||A||« J || J B|| oo e- c ^ B ), (36) 

N 

where d(A.B) denotes the distance between the supports of A and B. 
Moreover, for every ££ (0, 1) the following holds in V -probability: 

limP„. ffl (^<^<J-±^=l, (37) 
am-~ \fi(X,h) logN fi(X,h) J 

where jti is defined as 

1 l + e -2^ =l ((0„+h) 

ii{X,h) = - lim -logE . (38) 



The existence of the limit ( |38) , together with the bounds < jj.(X,h) < v(X,h) in 
the localized phase {ji(X,h) > in general, and this is seen in the same way as for 
f(A,/z) > 0), is proven in fl8l . where one can also find an argument showing that 
jj.(k,h) < v(X,K) under suitable (but not too restrictive) assumptions on the law of 
the charges. 

As a simple consequence of the exponential decay of correlations one can prove 
that 
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1 . the free energy is infinitely differentiable (in both X and h) in the localized region 

if; 

2. for every bounded local observable A the limit lim^^ooETv , ffl (A) exists P(do) 
almost surely and is reached exponentially fast. 

As expected, the rate of exponential decay of the correlation functions (or inverse 
correlation length), i.e. C2 in ( f3"6] >, tends to zero as h /*• h c (X). In general, it is a very 
interesting open problem to understand the relation between this and the way the 
free energy vanishes close to the critical point. In 11291 , for the special case where 
K(-) is the law of the first return to zero of the symmetric simple random walk on 
Z, it was proven that the best constant C2 coincides with fi(X,h) defined in ( [38] l. 

There would be much to say about fi(X,h) and F(X,h) and we prefer to refer 
the reader to the introduction of l20l where this issue is treated in detail for pinning 
models. Here, in a simplistic way, we just point out that Theorem[3]and the bounds 
mentioned just after it are telling us in particular that jj,(X,h) is as good as F(X,h) 
for detecting the localization transition. But: 

★ is it true that logfi(X,h) ~ logF(X,h) as h \ h c (X)l In view of the discussion 
on disorder relevance in section [T3] we expect that this is not the case and estab- 
lishing such a result would be very interesting. 



5.2 The delocalized phase 

Recall the definition Jifa = L^ =1 4„ in (jT3]l. The following theorem shows that, 
strictly inside the delocalized region, Jr$/ is typically at most of order logA^: 

Theorem 4 ( B17I ). For any 8 > 0, X > there exist c > 0,q > such that for every 

EP^aUAW+S^ > n) < e- c " V« > qlogN . (39) 

This result was proven in [ 17 1 under the assumption that the disorder law P sat- 
isfies a concentration inequality of sub-Gaussian type. This holds for instance in the 
case of Gaussian or bounded charges, and more generally whenever the distribution 
of o>i satisfies a Log-Sobolev inequality. 

Here we give a simpler argument, inspired by [22], which works under the gen- 
eral assumptions of section[T]on the disorder law and gives the weaker statement 

for some constant c. The same argument also shows that at the critical point JV^ is 
typically at most of order y/NlogN : 

EE w , ffl , W) (^) < c'^NlogN (41) 

for some other constant c'. 
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Recalling (BJ, for all ho,h\ > we can write 

^N,a,X,h := Z N,a,X,h\ ' E N,a>,XM t ex P ( 2A ( h l ~ ho)J^f)] ■ (42) 

Restricting Z N m x hl on the event {Ti > N, C,\ = 0}, we obtain the bound Z Nm xj n > 
jY.n>N K(n) ~ (const. )N~ a , by (j3j. Applying Jensen's inequality we obtain 

-j-E [logZ w . raa , Ao ] > ~E [\o E Z WM ] + 2X{hl ~ h0) EV N , m ,l. hl WW 

logjV 2A(M-Ap) 
> "CI H EEjv.a ^j, [^J , 

where cj € (0,»). By ^ we have logZ^^ > logZ N mA h - c 2 logN, for some 
constant c%, therefore by (JTOf we can write, for some c G (0,°°), 



F(A,*0) > sup {^^W^ [^] -c 1 ^ j . (44) 

• Take h Q = h c (X) and h\ = h c (X) + 8 with 5 > 0. Since F(X,h ) = 0, from $fy 
we obtain ( pO) . 

• Now take /?o = /i c (A) - 5, with 5 > 0, and h\ = h c (X). Since p(X,h c (X) — S)< 
C28 2 for some C2 — c 2 (X) € (0,°°) (cf. (|20]>), it follows again from |44} that 

hmsup ' < C, where C := ^ — , G (0,°°) , (45) 

^/NlogN A 

whence \A\) . 

Remark 5. Recalling ( |27] >, let A , /; > and 7 G [0, 1] be chosen such that L(y, X, h) < 
1, that is, ^x.ftO) ls a sub-probability kernel on No. Since Ky.x.h(') ~ cK(N)V ~ 
c'jy-rU+a) as — > 00 for some constants c,c' > 0, by ( |27"] > and ([3]), it is a basic 
result in renewal theory that the right hand side of ( |26l > is asymptotically equivalent 
to c' Ky i h (N) as AT — > °° for some constant c" > 0, cf. lfT5l Theorem A. 4]. Therefore 
we have for all N G N 

E ((Z c N , mX h) 7 ) < CiK rA . h (N) < C, ATW, (46) 

where here and in the sequel C; denotes a generic positive constant. Recalling Q 
and (BJ, for the original (non constrained) partition function we have 

E ((Z N , m ,x,h) 7 ) < C 2 N?K(Ny < C,N-y a . (47) 



This relation can be exploited to improve Theorem|4] showing that with high prob- 
ability jVn is of order 1. More precisely, from the bound Z Nl0 x.h > 5 Y,n>NK(n) ~ 
(const. )N~ a and ( |42] > it follows that for any 8 > 
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^N.coXh+s [exp {2X8 Jfr)] = ^ NMXJl < C 4 N a Z N , aXh , (48) 

hence applying ( |47] i we obtain 

E [(E N , mXh+s [exp (2A5^)])^ < CjiV^E ((Z^^) 7 ) < C 5 . (49) 
Recalling that 7 € [0, 1], by Markov's inequality we can write 

EP NtaXh+S {^N > n) < E (P N , m .X.h+s(^N > nf) 

< e- 2X5 r"E [(E N , mXh+5 [exp(2A5^v)])^ < C^ 2 ^ . 

Summarizing, we have shown that whenever relation ( |47| > holds true, there exists 
a constant C > such that for every 5 > 

EP N ^ Xh+s (^Y N >n)<Ce- ns ^, V»eN. (50) 

We stress that relation ( |4"7| ) holds true in particular for every X , h with h> h(X) (with 
a suitable choice of 7 6 [0, 1], recall ( |2"9| > and Proposition [TJ, hence also below the 
annealed critical curve. This is therefore an improvement of equation (1.12) in flD . 

We point out that derealization properties were also studied in [3]. However the 
nature of the delocalized phase, in the pathwise sense, is still very little understood 
and, notably, almost sure results are laking. For example: 

* Is it true that, if {k, h) is in the interior of @, for every e > we have 
lim A r^ cP w , ffl (4„ = for every n € [eA^,A^] nN) = P(do)-almost surely? 
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